Visual recognition based on temporal cortex cells: viewer-centred processing of pattern configuration.
نویسندگان
چکیده
A model of recognition is described based on cell properties in the ventral cortical stream of visual processing in the primate brain. At a critical intermediate stage in this system, 'Elaborate' feature sensitive cells respond selectively to visual features in a way that depends on size (+/- 1 octave), orientation (+/- 45 degrees) but does not depend on position within central vision (+/- 5 degrees). These features are simple conjunctions of 2-D elements (e.g. a horizontal dark area above a dark smoothly convex area). They can arise either as elements of an object's surface pattern or as a 3-D component bounded by an object's external contour. By requiring a combination of several such features without regard to their position within the central region of the visual image, 'Pattern' sensitive cells at higher levels can exhibit selectivity for complex configurations that typify objects seen under particular viewing conditions. Given that input features to such Pattern sensitive cells are specified in approximate size and orientation, initial cellular 'representations' of the visual appearance of object type (or object example) are also selective for orientation and size. At this level, sensitivity to object view (+/- 60 degrees) arises because visual features disappear as objects are rotated in perspective. Processing is thus viewer-centred and the neurones only respond to objects seen from particular viewing conditions or 'object instances'. Combined sensitivity to multiple features (conjunctions of elements) independent of their position, establishes selectivity for the configurations of object parts (from one view) because rearranged configurations of the same parts yield images lacking some of the 2-D visual features present in the normal configuration. Different neural populations appear to be selectively tuned to particular components of the same biological object (e.g. face, eyes, hands, legs), perhaps because the independent articulation of these components gives rise to correlated activity in different sets of input visual features. Generalisation over viewing conditions for a given object can be established by hierarchically pooling outputs of view-condition specific cells with pooling operations dependent on the continuity in experience across viewing conditions. Different object parts are seen together and different views are seen in succession when the observer walks around the object. The view specific coding that characterises the selectivity of cells in the temporal lobe can be seen as a natural consequence of selective experience of objects from particular vantage points. View specific coding for the face and body also has great utility in understanding complex social signals, a property that may not be feasible with object-centred processing.
منابع مشابه
Frameworks of analysis for the neural representation of animate objects and actions.
A variety of cell types exist in the temporal cortex providing high-level visual descriptions of bodies and their movements. We have investigated the sensitivity of such cells to different viewing conditions to determine the frame(s) of reference utilized in processing. The responses of the majority of cells in the upper bank of the superior temporal sulcus (areas TPO and PGa) found to be sensi...
متن کاملVisual Preferences of Small Urban Parks Based on Spatial Configuration of Place
The importance of small urban parks (SUP) in mega cities has been accepted as an essential component of urban lung and restorative settings. As urban population in the world increases and the cost of maintaining large parks escalates, urban authorities are shifting their attention to creating and maintaining smaller urban parks. However, SUP may present a different ambience due to their locatio...
متن کاملHandbook of Pattern Recognition and Computer Vision, Pp. 863{882 Viewer-centered Representations in Object Recognition: a Computational Approach
Visual object recognition is a process in which representations of objects are used to identify those objects in images. Recent psychophysical and physiological studies indicate that the visual system uses viewer-centered representations. In this chapter a recognition scheme that uses viewer-centered representations is presented. The scheme requires storing only a small number of views to repre...
متن کاملNovel temporal configurations of stimuli produce discrete changes in immediate-early gene expression in the rat hippocampus.
Changes in limbic brain activity in response to novel configurations of visual stimuli were assessed by quantifying two immediate-early genes, c-fos and zif268. Rats were first trained to use distal, visual cues to support radial-arm maze performance. Two separate sets of visual cues were used, one in the morning (Set A) and the other in the afternoon (Set B). On the final day the experimental ...
متن کاملCombining Multiple Views and Temporal Associations for 3-D object Recognition
This article describes an architecture for the recognition of three-dimensional objects on the basis of viewer centred representations and temporal associations. Considering evidence from psychophysics, neurophysiology, as well as computer science we have decided to use a viewer centred approach for the representation of three-dimensional objects. Even though this concept quite naturally sugges...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Zeitschrift fur Naturforschung. C, Journal of biosciences
دوره 53 7-8 شماره
صفحات -
تاریخ انتشار 1998